Hierarchical RAID: Design, performance, reliability, and recovery
نویسندگان
چکیده
Hierarchical RAID (HRAID) extends the RAID paradigm to mask the failure of whole Storage Nodes (SNs) or bricks, where each SN is a disk array with a certain RAID level. HRAIDk/l with N SNs and M disks per SN tolerates k SN failures and l disk failures per SN withMaximum Distance Separable (MDS) erasure codes, which introduce the minimum level of redundancy at each level. For N = M there are k internode and l intranode check strips per SN, occupying the capacity of as many disks with storage redundancy (k+ l)/N , but a higher storage redundancy is required forM > N . HRAIDk/l tolerates all disk failures up to dmin = (k+ 1)(l+ 1)− 1, but up to dmax = Nl+Mk− kl disk failures can be tolerated. Three options for HRAID operation are: (I) Only intranode recovery. (II) Intranode and internode recovery on demand reconstruction of blocks and rebuild. (III) Multistep internode recovery with no rebuild processing. The I/Os Per Second (IOPS)metric is used to assess the cost of fault-tolerance for HRAIDk/l against RAID(4+l) and RAID0, for varying k and l. Themaximum IOPS is at its lowest in degradedmode, but evenwith fewer operational disks the normal mode IOPS may be exceeded after restriping. Asymptotic reliability analysis and simulation results show that HRAIDk/l with l > k provides a higher reliability when SN failures are due to disk rather than controller failures. Monte Carlo simulation is used to quantify the effect of various recovery options with varying k and l and as the SN controller failure rate is varied with respect to disk failure rates on the Mean Time to Data Loss (MTTDL). The HRAID paradigm is justified by the fact that Options II attains a significantly higher MTTDL than Option I. Option III with no rebuild processing has an MTTDL exceeding Option II, but a poorer performance.4 © 2012 Elsevier Inc. All rights reserved.
منابع مشابه
RAID0.5: Active Data Replication for Low Cost Disk Array Data Protection
RAID has long been established as an effective way to provide highly reliable as well as high-performance disk subsystems. However, reliability in RAID systems comes at the cost of extra disks. In this paper, we describe a mechanism that we have termed RAID0.5 that enables striped disks with very high data reliability but low disk cost. We take advantage of the fact that most disk systems use b...
متن کاملThe Design of Large-Scale, Do-It-Yourself RAIDs
In this paper we explore the design of “Do-It-Yourself” RAIDs: RAID systems that can assembled by the end user from commercially available disks, enclosures, cables, racks, computers, and networks. We quantitatively evaluate the tradeoffs in cost, performance, and reliability of these DIY-RAID systems. Our principal result is an architecture that scales from 10s to 1000s of disks; we demonstrat...
متن کاملScalable Reliability Modelling of RAID Storage Subsystems
Reliability modelling of RAID storage systems with its various components such as RAID controllers, enclosures, expanders, interconnects and disks is important from a storage system designer’s point of view. A model that can express all the failure characteristics of the whole RAID storage system can be used to evaluate design choices, perform cost reliability trade-offs and conduct sensitivity...
متن کاملReliability Modelling of Whole RAID Storage Subsystems
Reliability modelling of RAID storage systems with its various components such as RAID controllers, enclosures, expanders, interconnects and disks is important from a storage system designer's point of view. A model that can express all the failure characteristics of the whole RAID storage system can be used to evaluate design choices, perform cost reliability trade-o s and conduct sensitivity ...
متن کاملOn Design and Implementation of a Large Virtual NVRAM Cache for Software RAID
In order to overcome the small write problem in RAID5, especially software RAID5, we have designed and implemented a software RAID with a large virtual NVRAM cache under the Linux kernel. Because no additional hardware is needed to implement our write cache, we named it Virtual NVRAM Cache or VC-RAID for short. The main idea is to use a combination of a small portion of the system RAM and a log...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- J. Parallel Distrib. Comput.
دوره 72 شماره
صفحات -
تاریخ انتشار 2012